Search results for "Sentence extraction"

showing 1 items of 1 documents

Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora

2017

This paper presents the BUCC 2017 shared task on parallel sentence extraction from comparable corpora. It recalls the design of the datasets, presents their final construction and statistics and the methods used to evaluate system results. 13 runs were submitted to the shared task by 4 teams, covering three of the four proposed language pairs: French-English (7 runs), German-English (3 runs), and Chinese-English (3 runs). The best F-scores as measured against the gold standard were 0.84 (German-English), 0.80 (French-English), and 0.43 (Chinese-English). Because of the design of the dataset, in which not all gold parallel sentence pairs are known, these are only minimum values. We examined …

Computer scienceSentence extractionbusiness.industrySpeech recognition020206 networking & telecommunications02 engineering and technologyGold standard (test)Spottingcomputer.software_genreTask (project management)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerNatural language processingSentenceProceedings of the 10th Workshop on Building and Using Comparable Corpora
researchProduct